-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Separate forward and backwad compilation and support higher order derivatives for aot_function #856
base: gh/anjali411/1/base
Are you sure you want to change the base?
Conversation
[ghstack-poisoned]
ghstack-source-id: 3197eae34a9003563e0b56235540c17a0ba0617c Pull Request resolved: #856
…tion" [ghstack-poisoned]
ghstack-source-id: 4de63f2aff78e0575fc342e13688308c542aa62f Pull Request resolved: #856
…tion" Test Plan: Existing tests should pass [ghstack-poisoned]
ghstack-source-id: 4bc7082b2b8cfc1f980b303689ab193e17ccb6c7 Pull Request resolved: #856
functorch/_src/aot_autograd.py
Outdated
@@ -140,12 +140,15 @@ def create_aot_autograd_function( | |||
compiled_fw = None | |||
compiled_bw = None | |||
num_outs = None | |||
joint_inputs = None |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
need to save these tensors in the context
…tion" Test Plan: Existing tests should pass [ghstack-poisoned]
ghstack-source-id: c24ee1b8c252d9aebe99b0beb9139dd3eb223dd4 Pull Request resolved: #856
functorch/_src/aot_autograd.py
Outdated
func_code = bw_module.code.split('self, ') | ||
# print(func_code[0] + func_code[1]) | ||
exec(func_code[0] + func_code[1], globals()) | ||
f = create_aot_autograd_function(forward, bw_compiler, bw_compiler, partition_fn, aot_decompositions, grad_state) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Two questions:
- Why are we passing
forward
to create_aot_autograd_function? I would have expected us to pass bw_module.code without the self argument - What is the exec for? Are you trying to test this without the
create_aot_autograd_function
line?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- forward is the name of the function generated by running
bw_module.code
- exec executes the bw_module.code to create a backward function which is the forward for the next pass
…tion" Test Plan: Existing tests should pass [ghstack-poisoned]
ghstack-source-id: 248de2f577fe3d61d4f2c40dc04570978dcc1543 Pull Request resolved: #856
…tion" Test Plan: Existing tests should pass [ghstack-poisoned]
ghstack-source-id: 056ba4b15716a5e5fca0481de1d9a02c08415a63 Pull Request resolved: #856
…tion" Test Plan: Existing tests should pass [ghstack-poisoned]
ghstack-source-id: 9f2c62614c3b7864aa58105a1934f4c4dc0fea60 Pull Request resolved: #856
…tion" Test Plan: Existing tests should pass [ghstack-poisoned]
ghstack-source-id: 0ce1d4ab26357b8614c57d539cdb61f1ae90a25e Pull Request resolved: #856
…tion" Test Plan: Existing tests should pass [ghstack-poisoned]
ghstack-source-id: f154dd1cbab518acd5890090ca081db1ec7fa20a Pull Request resolved: #856
…r order derivatives for aot_function" Test Plan: Existing tests should pass [ghstack-poisoned]
ghstack-source-id: 6575ea077b6baddee3ea60a41ca8f5d75ab5238a Pull Request resolved: #856
…r order derivatives for aot_function" Test Plan: Existing tests should pass [ghstack-poisoned]
ghstack-source-id: a21556926f1fc1f91f3bcd6e3075d77e99868c29 Pull Request resolved: #856
…r order derivatives for aot_function" Test Plan: Existing tests should pass [ghstack-poisoned]
ghstack-source-id: 0b78895219a89ec3841cf8b0804e1be69bfeed8a Pull Request resolved: #856
def fake_fn(primals, tangents): | ||
return fx_g_b(primals, tangents) | ||
fx_g_b = make_fx(functionalize(fake_fn))(flat_tensor_args, inp_grad_outs) | ||
saved_value_nodes = _get_saved_values(fx_g_b, saved_value_names) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Unfortunately this approach doesn't always work because the newly generated fx graph may not have the same nodes as the previous graph. We need an alternate way to select nodes of interest in this new graph!
Stack from ghstack (oldest at bottom):
Test Plan: Existing tests should pass